Network Analysis of the State Space of Discrete Dynamical Systems 
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We study networks representing the dynamics of elementary 1-d cellular automata (CA) on finite 
lattices. We analyze scaling behaviors of both local and global network properties as a function of 
system size. The scaling of the largest node in-degree is obtained analytically for a variety of CA 
including rules 22, 54 and 110. We further define the path diversity as a global network measure. The 
co-appearance of non-trivial scaling in both hub size and path diversity separates simple dynamics 
from the more complex behaviors typically found in Wolfram's Class IV and some Class III CA. 
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As physical theories widen into the biological and so- 
cial realms, the problem of characterizing complex dy- 
namical systems becomes increasingly important. Even 
elementary systems, such as cellular automata (CA), 
originally proposed by von Neumann often exhibit 
dynamical patterns that pose conceptual challenges and 
serve as test-beds for techniques to study more realistic 
phenomena. To date, attempts to classify the behav- 
ior of dynamical systems such as CA have been based 
on various definitions of complexity and have largely fo- 
cused onpatterns generated in space and time (see, e.g., 
Ref. Here we take a different approach, 

focusing on statistical properties of state space networks. 

The trajectories of a discrete dynamical system form a 
directed network in state space, wherein each node, rep- 
resenting a state, is the source of a link that points to its 
dynamical successor Q. For deterministic systems, each 
node has a single outgoing link (each out-degree is equal 
to 1). For irreversible systems, states may have different 
numbers of pre-images and thus different in-degrees. In 
this paper we present analytical and numerical studies 
of state space networks of various one-dimensional CA. 
This network perspective reveals previously unrecognized 
scaling behaviors and suggests a new measure of an as- 
pect of complexity that we term "path diversity." Since 
the CA we study are known to produce a wide variety 
of different dynamical behaviors, our results are relevant 
for understanding discrete dynamical systems in general. 

We examine 1-d binary CA with nearest neighbor in- 
teractions and periodic boundary conditions. Wolfram 
put these CA into four complexity classes [§| based on 
the qualitative appearance of spatio-temporal patterns 
produced from random initial conditions for large lattice 
size L. The four classes are: (I) the system almost always 
evolves quickly to a unique fixed point; (II) it almost al- 
ways evolves quickly to one of many attractors with a 
small period; (III) it generates seemingly random pat- 
terns with small-scale structures; (IV) it shows a mixture 
of order and randomness with long characteristic times. 
One class IV CA, Rule 110 (defined below), has been 
shown to emulate a universal Turing machine 0. One 



shortcoming of this classification is that the border be- 
tween classes III and IV is ill-defined. In fact, the classifi- 
cation of some rules (such as rule 54) is still disputed, and 
misclassifications can happen due to e.g. subtle and slow 
dynamics hidden beneath clear chaotic behavior. Exam- 
ples include rule 18, where annihilating random walks are 
embedded in a random pattern with small-scale struc- 
ture [To| , and rule 22 which shows random patterns with 
slow but (highly) statistically significant decrease of en- 
tropy with time and with L [11| . Other classifications 
have been suggested, but a definitive criterion for com- 
plex dynamics has not yet emerged [H, [ID, [l4| • 

In the present Letter we report numerical and ana- 
lytical results showing that class IV and some class III 
CA exhibit highly heterogeneous state space networks, 
unlike the networks corresponding to the simple CA in 
class I and II. The heterogeneity is reflected in local prop- 
erties, including broad in-degree distributions and finite- 
size scaling of the largest in-degree. We show, however, 
that this type of local heterogeneity can also occur in 
CA with simple dynamics. On the other hand, a global 
measure, termed the path diversity, by itself cannot dis- 
tinguish simple from complex CA either. However, it 
tends to show trivial behavior in simple CA where the 
local measures are non-trivial. In fact, the complex rules 
show non-trivial scaling behavior in both the local mea- 
sures and in the path diversity. On the basis of these 
observations, we speculate that network heterogeneity at 
multiple levels may be a generic property of the state 
space of complex dynamical systems. 



Let R denote the CA rule. The binary value 



of 



site i at time t + 1 is set equal to iZ(s*_ x s*s* +1 ) evalu- 
ated at the previous time t. We use periodic boundary 
conditions so that spatial indices are taken modulo L. 
To label the CA rules, we use Wolfram's scheme that 
identifies each rule with the number #(000) + 2#(001) + 
2 2 #(010) + 2 3 #(011) + 2 4 #(100) + 2 5 #(101) + 2 6 #(110) + 
2 7 #(111). Hence, for example, Rule 18 is the one for 
which #(100) = 1 and #(001) = 1 and R{s lS2 s 3 ) = for 
all other triples. 

A binary CA of size L has N = 2 L different states 
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S<°> = (4 a) ---Si-i) ; a = 0,...,N-l. These may 
be viewed as the nodes of a directed network, where a 
link from to indicates that R maps to S<- b \ 
When such a link exists, we say that is a pre-image of 
and the number of pre- images of is its in-degree 
k(S^). The network typically consists of disconnected 
clusters, each containing transient states and a recurrent 
dynamical attractor. Garden of Eden (GoE) states are 
transient states with zero in-degree. Pictures of some 
state space clusters are shown in Fig. [1] for more, see Q ■ 
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FIG. 2: (color on-line) The largest in-degree k max as a func- 
tion of N = 2 L . Except for class III rules 30, 45 and 150, 
all CA shown here exhibit clear scaling of the largest hub 
size k m ax with the total number of nodes in the network, N. 
Rule 160 is class I, 4 and 50 are class II, and 110 is class 
IV. Rules 18, 22, and 54 are between III and IV, as they 
have large structures masked by small-scale chaos (18, 22) or 
structures on intermediate scales (54). The analytic values of 
v are 0.8114, 0.6942, 0.5515, 0.4057, 0.3471, 0.4057 and 0.6942 
for rules 4, 18, 22, 50, 54, 110 and 160. These values are in 
perfect agreement with the numerical results. 



FIG. 1: One connected cluster out of each state space net- 
work for different CA, plotted with the program 'Pajek' 
(vlado.fmf.uni-lj.si/pub/networks/pajek/). All arrows point 
from leaves towards the center. Sizes are L — 10 to 13. For 
Rule 4, the network heterogeneity resides entirely in the size 
distribution of the different clusters making the state space 
network. 



These state space networks can be analyzed by a vari- 
ety of statistical measures - e.g. their degree distribution, 
clustering coefficients, etc. [la, Ha. Many real-world 
networks (e.g. regulatory networks [17| . the world-wide 
web [HI , and earthquakes [l|| ) differ markedly from ran- 
dom graphs (where degree distributions are Poissonian 
and clustering is absent), displaying "fat-tailed" or even 
scale-free degree distributions. In the following we show 
similar results also for state space networks. 

Fig. [5] demonstrates that several CA networks exhibit 
clean scaling for a particular local property, the in-degree 
of the largest hub, k max ~ N" . Scaling sets in already for 
rather small lattices and holds also for the second, third, 
etc., largest hub (data not shown). The rules shown in 
Fig. [5] were chosen to cover the entire spectrum of known 
behaviors for 1-d elementary CA. 

This scaling, including the value of v, can be derived 
exactly, provided one knows the structure of the hub state 
H. The latter can be guessed easily for all CA shown in 



Fig. [21 while it is less obvious for others. For all ele- 
mentary CA, the hub state is either periodic or - if the 
period does not match the lattice size - periodic with a 
few defects. 

For rule 18, e.g., one finds numerically that Ji = 
(00 • • • 00) for all L. According to the definition of CA 
18 given above, all sequences that do not contain 001 or 
100 as substrings are pre-images of H. 
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FIG. 3: Walks on the graph shown on the left generate all pre- 
images of the hub state Ti for rule 18. Each step corresponds 
to an element of the matrix T shown on the right, with rows 
and columns labeled in the order 00, 01, 10, 11. 

The number of distinct strings of length £ without '001' 
or '100' is equal to the number of walks with £ — 2 steps 
on the graph shown in Fig. [3] The number of periodic 
strings of length L is then given by TrT L « N log ^ Al , 
where T is the matrix shown on the right of Fig. [31 and Ai 
is its largest eigenvalue. This gives k max ~ N v with v = 
log 2 Ai = 0.6942, in perfect agreement with the numerical 
results. 

Similar analytic arguments hold for all the other CA 
that exhibit scaling in Fig. [5] When the hub state con- 
tains both '0's and T's, one has to introduce two transfer 
matrices T^ ^ and where maps each pair s t i _ 1 s t i 
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onto the pair s*s*. x , iff s- +1 



R(s\ 



s. The 



labeling of rows and columns is as in Fig.[3J The in-degree 
of any state S is then 



fc(S) =Tr(T (so) ---T^- 1 )) 



(1) 



The resulting exponents v are cited in the caption for 
Fig. [21 The same scaling (with the same exponents) holds 
also for the second, third, etc. largest hub. 

Another interesting property of rules giving large hubs 
is the scaling of the in-degree distribution function P(k) 
with system size. Fig.|4]shows the data collapse for Rules 
4, 22, and 110 found using the multiscaling ansatz [2(| 



logP(yfc) = logN /(log k/ log N) . 



(2) 



It is significantly better than the usual finite size power- 
law scaling, which would produce straight lines in Fig. [4] 
To decide whether the apparent curvature is a finite size 
effect or a true indication of multiscaling is in general not 
easy, but it can be done analytically for the simple CA, 
Rule 4. 
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FIG. 4: (color on-line) In-degree distribution functions col- 
lapsed for simple and complex rules using a multiscaling 
ansatz for rules 4, 22, and 110 for different system sizes. The 
black solid line is Eq. (0. The distributions were shifted by 
0.5 units each for clarity. Rule 22 is consistent with a power 
law P(k) ~ AT 7 with 7 w 2.8. 



For Rule 4, the sequence . . . 11 . . . has no pre-image and 
the sequence . . . 1 . . . has the unique pre-image . . . 010 . . .. 
T^ 1 ) can then be written as eje 3 , where e 2 = (0,1, 0, 0) 
and e 3 = (0,0,1,0). The in-degree of any state S can 
then be expressed as 



n^K)=n e ^ T(o) ] m * 4, 



(3) 



where 



T (o) 



( 1 1 

1 

110 

\0 1 1 



(4) 



the product runs over all 'l's in S, and m,i is the number 
of '0's following the ith '1' in S. 



For large to we have w(m) w a\ m , where A = 1.75488 
is the largest eigenvalue of T^ ^ and a = 0.234487 = 
(e2 • u)(v • 63), u and v being the right and left eigen- 
vectors corresponding to A, normalized to v • u = 1. Any 
state containing n isolated Is will then have an in-degree 



Although occurrences of 



small rrii can only be neglected for n <C L, we find em- 
pirically that this formula reflects the qualitative behav- 
ior for larger n. To find P(k), we now need to find 
Q(n), the number of states with n isolated Is and no 
pairs '11', and transform its dependence on n into a de- 
pendence on k. On a periodic lattice of length L, one 
has fi(n) = C(L — n,n) + C(L — n — l,n— 1), where 
C(£,m) = £\/[ml{£ — to)!]. To obtain an approximation 
for P(k) we first invert the above relation between k n 
and n, giving n(k) = (Lin A — lnfc)/[ln(A/a)], then use 
P(k) w 2- L Q(n)\dn/dk\. 

The scaling ansatz shown in Fig. U is indeed recovered 
by this approximation. To see this, we define 



x = lnfc/lnAT, y = lnP(fc)/ln N 



With this choice, we have 



n 
L 



In A — x In 2 
ln(A/a) 



(5) 



(6) 



Neglecting a term In ln(A/a)/(L In 2) in y, using Stirling's 
formula, and taking the large L limit of In[f2(n)]/L for 
fixed x, we get 



y 



-1 



i°g 2 



(i 



e«(l-2e) 



(7) 



Here e = n/L is a function of x through Eq. (|6|). Eq. (|7|) 
is shown as a solid line in Fig. 2] The curvature of this 
line clearly indicates that no substantial range exists over 
which the distribution is a power law. Using a Fourier 
method based on recursion relations, we have also deter- 
mined numerically the exact distribution for L = 10, 000. 
It shows slightly enhanced curvature for small x, and is 
for large x in excellent agreement with the above approx- 
imation. Hence we conclude that the apparent curvature 
seen in Fig. 4 for Rules 4 and 110, is not simply a finite 
size effect but more likely an indication of multiscaling. 

As Fig. 1 indicates, rule 4 does not exhibit hetero- 
geneity beyond the single node level. All transients have 
length 1, all attractors are simple fixed points, and the 
only 'complex' aspect is the broad in-degree distribution 
of the latter. Clearly, the in-degree distribution of any 
set of nodes, being a strictly local construct, cannot by 
itself distinguish complex CA from trivial ones. To make 
this distinction we introduce a new quantity, the path 
diversity T>. It measures fluctuations in the set of dif- 
ferent paths connecting the GoE states to attractors. If 
one projects all attractor states into a single node, then 
the state space network of a CA becomes a rooted tree. 
Roughly, T> counts the number of non-equivalent choices 
encountered by following each path from an attractor 
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FIG. 5: (color on-line) The path diversity T> of different CA. 
Straight lines indicate scaling T> ~ N 5 . Fitted values of 5 are 
0.88, 0.88, 0.85, 0.75 and 0.72 for rales 110, 22, 54, 18 and 30, 
respectively. 



(root) to a GoE state (leaf). Path diversity bears re- 
semblances to tree diversity 0] and topological depth Q • 

We first define the path diversity V for each tran- 
sient node: A GoE state has diversity one; a transient 
state with a single pre-image has the same diversity as 
its unique pre-image; and the diversity of a node with 
more than one incoming link is the sum of all distinct 
diversities of its pre-images plus one. Thus, if a node has 
e.g. in-degree 5 and three of its pre-images have diver- 
sity 2, one has diversity 6, and the last has diversity 17, 
then that node's diversity is 2 + 6 + 17 + 1 = 26. Finally, 
the path diversity T> of the entire CA is computed by 
joining all attractor states (in all disconnected compo- 
nents, if there are several) into one single "meta-state" , 
and applying the above scheme to the meta-state. 

In Fig. [51 2? is shown for several CA. It is sensitive to 
aspects of the network's structure that are different from 
the node degree distribution. As a result, it clearly sep- 
arates rules 4 (where T> = 2 for all L) and 150 (where T> 
seems to remain bounded) from other CA. The most in- 
teresting rules are those which show clear scaling T> ~ N s 
with S close to 1, e.g. S = 0.88 ± 0.01 (rules 22, 110), 



0.85±0.01 (rule 54), 0.75±0.04 (rule 18), and 0.72±0.03 
(rule 30). For the system sizes studied, rules 50 and 160 
appear to show scaling with smaller <5, though both would 
be classified as simple (classes I or II) by Wolfram. An an- 
alytical calculation of upper bounds on transient lengths 
shows, however, that T> actually grows slower than any 
positive power of N for rule 160, and we expect the same 
to hold for rule 50 given its similar numerics. The oscil- 
lations seen in Fig. [5] for some rules are due to the global 
constraint imposed by periodic boundary conditions. In 
the most extreme case, rule 45 has nontrivial transients 
for odd L but none at all for even L. 

In general, we neither expect that a single observable 
can reliably distinguish "complex" from other behavior, 
nor that a few discrete classes can do justice to the mul- 
tifarious ways in which a system can display complexity. 
This is supported by our analysis. Neither the scaling 
of the hub sizes with system size (or the scaling of the 
in-degree distribution) nor the scaling of the path di- 
versity can by itself distinguish between "simple" (Wol- 
fram classes I and II) and "complex" (class IV and some 
class III) dynamics. Combined together, they fare al- 
ready much better. We note also that within the class 
of CA for which T> scales, those with largest 5 (CA 110 
and 22) are also those which have been considered most 
complex by previous authors 0, [ll[ . 

In summary, we have studied networks formed by 
states of elementary 1-d CA on finite lattices. We find 
that some of them exhibit non-trivial scaling with system 
size, and this scaling behavior can be computed analyti- 
cally in certain cases. Taken together, the the statistics of 
degree distribution (a local property) and path diversity 
(a global property) give a good indication of dynamical 
complexity. Including additional measures for network 
heterogeneity at intermediate scales could further refine 
this approach. 
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